Stillwater
Hybrid LSTM-Transformer Models for Profiling Highway-Railway Grade Crossings
Chatterjee, Kaustav, Li, Joshua Q., Ansari, Fatemeh, Munna, Masud Rana, Parajulee, Kundan, Schwennesen, Jared
Hump crossings, or high-profile Highway Railway Grade Crossings (HRGCs), pose safety risks to highway vehicles due to potential hang-ups. These crossings typically result from post-construction railway track maintenance activities or non-compliance with design guidelines for HRGC vertical alignments. Conventional methods for measuring HRGC profiles are costly, time-consuming, traffic-disruptive, and present safety challenges. To address these issues, this research employed advanced, cost-effective techniques and innovative modeling approaches for HRGC profile measurement. A novel hybrid deep learning framework combining Long Short-Term Memory (LSTM) and Transformer architectures was developed by utilizing instrumentation and ground truth data. Instrumentation data were gathered using a highway testing vehicle equipped with Inertial Measurement Unit (IMU) and Global Positioning System (GPS) sensors, while ground truth data were obtained via an industrial-standard walking profiler. Field data was collected at the Red Rock Railroad Corridor in Oklahoma. Three advanced deep learning models Transformer-LSTM sequential (model 1), LSTM-Transformer sequential (model 2), and LSTM-Transformer parallel (model 3) were evaluated to identify the most efficient architecture. Models 2 and 3 outperformed the others and were deployed to generate 2D/3D HRGC profiles. The deep learning models demonstrated significant potential to enhance highway and railroad safety by enabling rapid and accurate assessment of HRGC hang-up susceptibility.
Bayesian Joint Model of Multi-Sensor and Failure Event Data for Multi-Mode Failure Prediction
Fard, Sina Aghaee Dabaghan, Kim, Minhee, Deep, Akash, Lee, Jaesung
Modern industrial systems are often subject to multiple failure modes, and their conditions are monitored by multiple sensors, generating multiple time-series signals. Additionally, time-to-failure data are commonly available. Accurately predicting a system's remaining useful life (RUL) requires effectively leveraging multi-sensor time-series data alongside multi-mode failure event data. In most existing models, failure modes and RUL prediction are performed independently, ignoring the inherent relationship between these two tasks. Some models integrate multiple failure modes and event prediction using black-box machine learning approaches, which lack statistical rigor and cannot characterize the inherent uncertainty in the model and data. This paper introduces a unified approach to jointly model the multi-sensor time-series data and failure time concerning multiple failure modes. This proposed model integrate a Cox proportional hazards model, a Convolved Multi-output Gaussian Process, and multinomial failure mode distributions in a hierarchical Bayesian framework with corresponding priors, enabling accurate prediction with robust uncertainty quantification. Posterior distributions are effectively obtained by Variational Bayes, and prediction is performed with Monte Carlo sampling. The advantages of the proposed model is validated through extensive numerical and case studies with jet-engine dataset.
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Bilal, Ahsan, Mohsin, Muhammad Ahmed, Umer, Muhammad, Bangash, Muhammad Awais Khan, Jamshed, Muhammad Ali
--This survey explores the development of meta-thinking capabilities in Large Language Models (LLMs) from a Multi-Agent Reinforcement Learning (MARL) perspective. The survey begins by analyzing current LLM limitations, such as hallucinations and the lack of internal self-assessment mechanisms. It then talks about newer methods, including RL from human feedback (RLHF), self-distillation, and chain-of-thought prompting, and each of their limitations. The crux of the survey is to talk about how multi-agent architectures, namely supervisor-agent hierarchies, agent debates, and theory of mind frameworks, can emulate human-like introspective behavior and enhance LLM robustness. By exploring reward mechanisms, self-play, and continuous learning methods in MARL, this survey gives a comprehensive roadmap to building introspective, adaptive, and trustworthy LLMs. Evaluation metrics, datasets, and future research avenues, including neuroscience-inspired architectures and hybrid symbolic reasoning, are also discussed. THE cognitive abilities, such as intelligence and creativity, have played a fundamental role in human discoveries and inventions. Understanding the relationship between these two cognitive abilities is important not only for the advancement of psychological theories but also for the improvement of educational practices [1]. However, researchers still hold different views on how intelligence and creativity interact, often leading to conflicting findings. A key question in this discourse is how intelligence enables structured problem-solving, while creativity fosters novel solutions that are essential for human cognition and artificial intelligence systems. Ahsan Bilal is with University of Oklahoma, Norman, OK, 73072, USA (e-mail: ahsan.bilal-1@ou.edu). Muhammad Ahmed Mohsin, Muhammad Umer are with Stanford University, Stanford, CA, 94305, USA (e-mail: muahmed, mumer@stanford.edu). Muhammad A wais Khan Bangash is with the School of Electrical and Computer Engineering, Oklahoma State University, Stillwater, OK, 74075 USA (e-mail: awais.bangash@okstate.edu). Muhammad Ali Jamshed is with University of Glasgow, G12 8QQ, Glasgow, UK (e-mail: muhammadali.jamshed@glasgow.ac.uk). Similarly, in problem-solving tasks, intelligence aids in analyzing constraints, while creativity allows for flexible and unconventional approaches. Moreover, the role of internal thought processes varies with task complexity. Simpler tasks require minimal reasoning, whereas more complex tasks demand deeper cognitive engagement. This principle extends to artificial intelligence, where more sophisticated models exhibit enhanced performance in tasks requiring higher-order thinking.
Adaptive Anomaly Recovery for Telemanipulation: A Diffusion Model Approach to Vision-Based Tracking
Wang, Haoyang, Guo, Haoran, Tao, Lingfeng, Li, Zhengxiong
Dexterous telemanipulation critically relies on the continuous and stable tracking of the human operator's commands to ensure robust operation. Vison-based tracking methods are widely used but have low stability due to anomalies such as occlusions, inadequate lighting, and loss of sight. Traditional filtering, regression, and interpolation methods are commonly used to compensate for explicit information such as angles and positions. These approaches are restricted to low-dimensional data and often result in information loss compared to the original high-dimensional image and video data. Recent advances in diffusion-based approaches, which can operate on high-dimensional data, have achieved remarkable success in video reconstruction and generation. However, these methods have not been fully explored in continuous control tasks in robotics. This work introduces the Diffusion-Enhanced Telemanipulation (DET) framework, which incorporates the Frame-Difference Detection (FDD) technique to identify and segment anomalies in video streams. These anomalous clips are replaced after reconstruction using diffusion models, ensuring robust telemanipulation performance under challenging visual conditions. We validated this approach in various anomaly scenarios and compared it with the baseline methods. Experiments show that DET achieves an average RMSE reduction of 17.2% compared to the cubic spline and 51.1% compared to FFT-based interpolation for different occlusion durations.
Bio-Skin: A Cost-Effective Thermostatic Tactile Sensor with Multi-Modal Force and Temperature Detection
Guo, Haoran, Wang, Haoyang, Li, Zhengxiong, Tao, Lingfeng
Tactile sensors can significantly enhance the perception of humanoid robotics systems by providing contact information that facilitates human-like interactions. However, existing commercial tactile sensors focus on improving the resolution and sensitivity of single-modal detection with high-cost components and densely integrated design, incurring complex manufacturing processes and unaffordable prices. In this work, we present Bio-Skin, a cost-effective multi-modal tactile sensor that utilizes single-axis Hall-effect sensors for planar normal force measurement and bar-shape piezo resistors for 2D shear force measurement. A thermistor coupling with a heating wire is integrated into a silicone body to achieve temperature sensation and thermostatic function analogous to human skin. We also present a cross-reference framework to validate the two modalities of the force sensing signal, improving the sensing fidelity in a complex electromagnetic environment. Bio-Skin has a multi-layer design, and each layer is manufactured sequentially and subsequently integrated, thereby offering a fast production pathway. After calibration, Bio-Skin demonstrates performance metrics-including signal-to-range ratio, sampling rate, and measurement range-comparable to current commercial products, with one-tenth of the cost. The sensor's real-world performance is evaluated using an Allegro hand in object grasping tasks, while its temperature regulation functionality was assessed in a material detection task.
Neural Models of Task Adaptation: A Tutorial on Spiking Networks for Executive Control
Kannan, Ashwin Viswanathan, Ganesan, Madhumitha
The ability to adapt and switch between tasks is a fundamental Empirical studies further established the prefrontal cortex aspect of cognitive flexibility, shaping decision-making (PFC) as a key region in task-switching, with experiments such and behavioral efficiency in dynamic environments. Taskswitching as the Wisconsin Card Sorting Test (WCST) demonstrating its has been widely studied across disciplines such as role in adaptive behavior [14]-[16]. Spiking Neural Networks psychology, cognitive neuroscience, and artificial intelligence (SNNs) have emerged as a biologically realistic approach to [1], [2]. While humans often shift between tasks seamlessly, modeling neural dynamics, particularly due to their ability to performance variations arise depending on prior experience, replicate synaptic plasticity mechanisms such as Spike Timing-task familiarity, and cognitive load. Understanding these processes Dependent Plasticity (STDP) [10], [17]. Prior studies have requires computational models that can capture the successfully applied SNNs to pattern recognition and classification underlying neural mechanisms driving adaptive control and tasks [18] and have modeled sensory processing systems decision-making. Empirical studies have identified increased like the mammalian olfactory system [19]. These findings neural activity in the cognitive control network, particularly in establish a computational foundation for implementing taskswitching the prefrontal cortex (PFC), when engaging in task-switching models with biologically grounded learning dynamics.
Hopfield Networks Meet Big Data: A Brain-Inspired Deep Learning Framework for Semantic Data Linking
Kannan, Ashwin Viswanathan, Thomas, Johnson P, Mukerji, Abhimanyu
The exponential rise in data generation has led to vast, heterogeneous datasets crucial for predictive analytics and decision-making. Ensuring data quality and semantic integrity remains a challenge. This paper presents a brain-inspired distributed cognitive framework that integrates deep learning with Hopfield networks to identify and link semantically related attributes across datasets. Modeled on the dual-hemisphere functionality of the human brain, the right hemisphere assimilates new information while the left retrieves learned representations for association. Our architecture, implemented on MapReduce with Hadoop Distributed File System (HDFS), leverages deep Hopfield networks as an associative memory mechanism to enhance recall of frequently co-occurring attributes and dynamically adjust relationships based on evolving data patterns. Experiments show that associative imprints in Hopfield memory are reinforced over time, ensuring linked datasets remain contextually meaningful and improving data disambiguation and integration accuracy. Our results indicate that combining deep Hopfield networks with distributed cognitive processing offers a scalable, biologically inspired approach to managing complex data relationships in large-scale environments.
Federated Granger Causality Learning for Interdependent Clients with State Space Representation
Mohanty, Ayush, Mohamed, Nazal, Ramanan, Paritosh, Gebraeel, Nagi
Advanced sensors and IoT devices have improved the monitoring and control of complex industrial enterprises. They have also created an interdependent fabric of geographically distributed process operations (clients) across these enterprises. Granger causality is an effective approach to detect and quantify interdependencies by examining how one client's state affects others over time. Understanding these interdependencies captures how localized events, such as faults and disruptions, can propagate throughout the system, possibly causing widespread operational impacts. However, the large volume and complexity of industrial data pose challenges in modeling these interdependencies. This paper develops a federated approach to learning Granger causality. We utilize a linear state space system framework that leverages low-dimensional state estimates to analyze interdependencies. This addresses bandwidth limitations and the computational burden commonly associated with centralized data processing. We propose augmenting the client models with the Granger causality information learned by the server through a Machine Learning (ML) function. We examine the co-dependence between the augmented client and server models and reformulate the framework as a standalone ML algorithm providing conditions for its sublinear and linear convergence rates. We also study the convergence of the framework to a centralized oracle model. Moreover, we include a differential privacy analysis to ensure data security while preserving causal insights. Using synthetic data, we conduct comprehensive experiments to demonstrate the robustness of our approach to perturbations in causality, the scalability to the size of communication, number of clients, and the dimensions of raw data. We also evaluate the performance on two real-world industrial control system datasets by reporting the volume of data saved by decentralization.